Communication terminal, method for controlling communication terminal, communication system, and storage medium

ABSTRACT

A communication terminal acquires a captured image from an image capturing apparatus and transfers the captured image to a server. The communication terminal includes: a first communication unit configured to communicate with the image capturing apparatus; a second communication unit configured to communicate with the server; a display unit configured to communicate with the image capturing apparatus, acquire, from the image capturing apparatus, a captured image managed by the image capturing apparatus, and display the captured image; a sound input unit configured to accept sound input for generating a terminal sound memo related to the displayed captured image; a storage unit configured to store the generated terminal sound memo in association with the captured image; and a control unit configured to perform control so that the second communication unit transfers, to the server, the acquired captured image and the generated terminal sound memo associated with the captured image.

BACKGROUND Field

The present disclosure relates to a technique for transferring an image,which is captured by an image capturing apparatus, such as a digitalcamera, to a server on the Internet, via a communication terminal, suchas a smartphone, which is connected to the image capturing apparatus.

Description of the Related Art

With the development of digital image capturing techniques andcommunication techniques, mass media such as news media and newsagencies require a system that enables rapid transmission, by using acommunication technique, of a large number of images captured in thefield to a base where those images are edited, the system alsoperforming editing and processing to output the resulting images on amedium. Therefore, the mass media need an image capturing apparatushaving an advanced communication function and need a communicationtechnique that is usable at a high speed in a wide area.

Although recent digital cameras have wireless communication functions,such functions are based on near field communication. Therefore, fortransmitting data obtained in the field, such as images, from remotelocations to a base where such data can be edited, equipment, such aspersonal computers or access points, which are used to connect to theInternet, and a communication environment are needed. Thus, ageographical area where such data can be collected in the field andtransmitted to a base by using conventional digital cameras is limited.

Rapid transmission of gathered data by utilizing public wirelesscommunication systems, from any region of a country to a base where thedata is edited, via a mobile communication terminal such as asmartphone, has been examined. In the future, Fifth Generation (5G)communication is anticipated to become widespread, and communicationenvironments in which large-capacity data is transmitted at high speedwith short delays are expected to become available, and utilization ofmobile communication terminals may enable more rapid transmission ofsuch large-capacity data to a base where the data is edited.

In addition, there is a technique by which, when a still image or amoving image is captured by a digital camera, a user records sound dataas a sound memo in association with image data, thereby organizing andassembling information captured. Japanese Patent Laid-Open No.2018-186467 discloses a technique by which a user records a sound memowith a captured image by using a sound memo input function of a digitalcamera.

In the technique disclosed in Japanese Patent Laid-Open No. 2018-186467,however, the sound memo is able to be recorded only by an imagecapturing apparatus having a sound memo input function, and other imagecapturing apparatuses that do not have the sound memo input function arenot able to record the sound memo.

SUMMARY

According to various embodiments of the present disclosure, even in animage capturing apparatus that does not have a sound memo inputfunction, a sound memo is able to be recorded by a communicationterminal connected to the image capturing apparatus, and when an imagein the image capturing apparatus is transmitted to a server via thecommunication terminal, the associated sound memo recorded in thecommunication terminal is also able to be transmitted.

According to one embodiment of the present disclosure, a communicationterminal is provided that acquires a captured image from an imagecapturing apparatus and transfers the captured image to a server. Thecommunication terminal includes: a first communication unit configuredto communicate with the image capturing apparatus; a secondcommunication unit configured to communicate with the server; a displayunit configured to communicate with the image capturing apparatus byusing the first communication unit, acquire, from the image capturingapparatus, a captured image managed by the image capturing apparatus,and display the captured image; a sound input unit configured to acceptsound input for generating a terminal sound memo related to the capturedimage displayed by the display unit; a storage unit configured to storethe generated terminal sound memo in association with the capturedimage; and a control unit configured to perform control so that thesecond communication unit transfers, to the server, the captured imageacquired from the image capturing apparatus by using the firstcommunication unit and the generated terminal sound memo associated withthe captured image.

Further features will become apparent from the following description ofexemplary embodiments with reference to the attached drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a view for explaining an example of a configuration of asystem that includes communication equipment and an image capturingapparatus according to one embodiment of the present disclosure.

FIG. 2 is a block diagram illustrating an example of a hardwareconfiguration of the communication equipment according to one embodimentof the present disclosure.

FIG. 3A is a block diagram illustrating an example of a configuration ofthe image capturing apparatus according to the embodiment of presentdisclosure, and FIGS. 3B and 3C are views each illustrating an exampleof an external appearance of the image capturing apparatus according toone embodiment of the present disclosure.

FIG. 4 is a schematic view illustrating an example of an operationscreen of an image transfer application according to a first embodimentof the present disclosure.

FIG. 5 is a schematic view illustrating an example of a confirmationscreen of the operation screen of the image transfer applicationaccording to the first embodiment of present disclosure.

FIG. 6 is a schematic view illustrating an example of a setting screenof the image transfer application according to the first embodiment ofthe present disclosure.

FIG. 7 is a flowchart illustrating an example of a flow of processing ofthe image transfer application according to the first embodiment of thepresent disclosure.

FIG. 8 is a schematic view illustrating an example of a setting screenof an image transfer application according to a second embodiment of thepresent disclosure.

FIG. 9 is a flowchart illustrating an example of a flow of processing ofstarting transmission when a camera sound memo is updated according tothe second embodiment of the present disclosure.

FIG. 10 is a flowchart illustrating an example of a flow of processingof starting transmission when a terminal sound memo is updated accordingto the second embodiment of the present disclosure.

FIG. 11 is a block diagram illustrating a concept of a transfermanagement table of an image transfer application according to a thirdembodiment of the present disclosure.

FIG. 12 is a flowchart illustrating an example of a flow of processingof the image transfer application according to the third embodiment ofthe present disclosure.

FIG. 13 is a schematic view illustrating an example of a setting screenof an image transfer application according to a fourth embodiment of thepresent disclosure.

FIG. 14 is a flowchart illustrating an example of a flow of processingof the image transfer application according to the fourth embodiment ofthe present disclosure.

DESCRIPTION OF THE EMBODIMENTS

Various embodiments of the present disclosure will be described indetail below with reference to the accompanying drawings. However,components described in the described embodiments are merely examples,and the scope of the invention is not intended to be limited only to theembodiments.

First Embodiment

(System Configuration)

FIG. 1 is a schematic view of a system configuration of a system thatincludes communication equipment and an image capturing apparatusaccording to one embodiment of the present disclosure.

In FIG. 1 , communication equipment 101 operating as a mobile terminalis an information processing apparatus that is represented by asmartphone or a computer and that has a communication function. An imagecapturing apparatus 102 is a digital camera or the like. A server 103 isan image storage apparatus installed at a destination of a connectionachieved with the communication equipment 101 via a communicationnetwork such as the Internet. In FIG. 1 , the communication equipment101 and the image capturing apparatus 102 are connected by a wired orwireless communication system via a connection unit 104 such as a cableor Wi-Fi, and a still image generated by the image capturing apparatus102 or a sound file recorded to accompany the still image is able to betransferred to the communication equipment 101. That is, when thecommunication equipment 101 and the image capturing apparatus 102communicate via the connection unit 104, they function as acommunication system. Further, the communication equipment 101 and theserver 103 are connected through a communication unit 105 such as amobile network, and information temporarily saved in the communicationequipment 101 is able to be transferred to the server 103 via thecommunication unit 105.

(Communication Equipment Configuration)

FIG. 2 is a block diagram illustrating an example of a hardwareconfiguration of the communication equipment 101 according to thepresent embodiment.

In FIG. 2 , a housing 200 is a housing of the communication equipment101. A display apparatus 201 such as an LCD (liquid crystal display) oran EL (electroluminescent) display that displays an operation screen ora state screen of the communication equipment 101 is constituted by atouch panel and is used additionally as a pointing device by detecting auser input position, for example, in accordance with contact with afinger, a stylus pen, or the like. It is also possible to provide asoftware keyboard by sensing a position of a character key by usingimage formation in a keyboard form displayed on the display apparatus201.

In a VRAM (video RAM) 202, an image to be displayed on a display screenof the display apparatus 201 is drawn. The image generated in the VRAM202 is transferred to the display apparatus 201 in accordance with apredetermined rule so that the image is displayed on the displayapparatus 201.

A built-in camera 203 as an optical image information input apparatusattached to the communication equipment 101 receives input of, forexample, image information of a still image, a moving image, or the likeor fingerprint authentication information via a touch sensor or thelike.

A central processing unit (CPU) 204 controls respective devices, whichare connected via a bus 212, on the basis of control programs stored ina read-only memory (ROM) 205 and storage 207.

A ROM 205 holds various control programs and data.

A RAM (random access memory) 206 provides a work area for the CPU 204, adata saving area used during error processing, an area for loading acontrol program, and the like. In the system of the present embodiment,various programs for controlling the system are loaded.

The storage 207 is data storage in the information processing apparatus101 of the present disclosure in which electronic information isrecorded and saved and corresponds to a hard disk drive (HDD), a solidstate drive (SSD) constituted by flash memory, a hybrid drive using botha hard disk and flash memory, a memory card, a memory card readingdevice, or the like and also includes a large-capacity external harddisk drive HDD connected through a USB interface 208 described later.

A USB (Universal Serial Bus) interface 208 is connected to an externaldevice such as a USB memory, a CD (Compact Disc) drive, or a DVD(Digital Versatile Disc) drive so that it is possible to expandfunctions of the information processing apparatus.

A communication interface 209 is an interface for communication withother information processing apparatuses, a printer, a server systemapparatus on the Internet, or the like by wired communication, wirelesscommunication, or the like. A wireless communication unit using thecommunication interface 209 also includes a simple communicationfunction with low power consumption, and examples thereof include BLE(Bluetooth® Low Energy) and NFC (Near Field Communication).Communication with the image capturing apparatus 102 having the wirelesscommunication function is also performed by the communication interface209.

A mobile telephone interface 210 is an interface module for mobiletelephone communication and enables communication processing to beperformed according to a mobile telecommunication standard, such as 3Gor LTE. The mobile telephone interface 210 also includes an antennadevice that is needed for input and output of radio waves for thecommunication processing.

A GPS (global positioning system) interface 211 that is a module for GPScaptures radio waves from a plurality of GPS satellites by using anantenna function provided in the module and causes the CPU 204 toperform default calculation processing, thus making it possible tospecify location information of the communication equipment 101.

Moreover, though not illustrated, the communication equipment 101 has amicrophone as a sound input unit configured to receive input sound andenables sound data input through the microphone to be stored.

In the present embodiment, a control program processed by the CPU 204may be provided from the ROM 205, the RAM 206, or the data storage 207or from another information processing apparatus or the like by usingthe USB interface 208, the network interface 209, or the mobiletelephone interface 210.

(Image Capturing Apparatus Configuration)

FIG. 3A is a block diagram illustrating an example of a configuration ofthe image capturing apparatus 102 of the present embodiment.

A control unit 301 controls respective constituent units of the imagecapturing apparatus 102 in accordance with an input signal or a runningapplication. Note that, instead of the control unit 301 controlling theentire apparatus, a plurality of pieces of hardware may control theentire apparatus by sharing processing.

An image capturing unit 302 includes, for example, an optical lens unit,an optical system that controls a diaphragm, zooming, focusing, and thelike, and an image capturing element for converting light (video), whichis introduced via the optical lens unit, into an electrical videosignal. A CMOS (complementary metal-oxide semiconductor) or a CCD(charge coupled device) is generally used as the image capturingelement. Under control of the control unit 301, the image capturing unit302 converts subject light focused by a lens included in the imagecapturing unit 302 into an electrical signal by using the imagecapturing element, performs noise reduction processing or the like, andoutputs image data as digital data. In the image capturing apparatus 102of the present embodiment, the image data is recorded on a recordingmedium 310 in accordance with a DCF (Design rule for Camera File system)standard.

A non-volatile memory 303 is an electrically erasable/recordablenon-volatile memory and stores an application program executed by thecontrol unit 301 or the like.

A work memory 304 is used as a buffer memory that temporarily holds dataof an image captured by the image capturing unit 302, as an imagedisplay memory for a display unit 306, a work area for the control unit301, or the like.

An operation unit 305 is used to accept an instruction for the imagecapturing apparatus 102 from the user. The operation unit 305 includes,for example, a power button for giving an instruction from the user toswitch ON/OFF power of the image capturing apparatus 102, a releaseswitch 305 a for giving an image capturing instruction, a reproducebutton 305 b for giving an instruction to reproduce image data, and thelike. The operation unit 305 further includes an operation member suchas a connection button dedicated to starting communication with externalequipment via a communication unit 311 described later. The operationunit 305 also includes a touch panel 305 d forming the display unit 306described later.

Note that, the release switch 305 a includes an SW1 and an SW2. The SW1is switched ON when the release switch 305 a is in a so-calledhalf-pressed state. Thereby, an instruction for preparing for imagecapturing, such as AF (autofocus) processing, AE (automatic exposure)processing, AWB (automatic white balance) processing, and EF (flashpreliminary emission) processing, is accepted. The SW2 is switched ONwhen the release switch 305 a is in a so-called full-pressed state.Thereby, an instruction for capturing an image is accepted.

The display unit 306 displays a viewfinder image during image capture,captured image data, text for user interaction, or the like. Note that,the image capturing apparatus 102 does not necessarily include thedisplay unit 306. It is only necessary that the image capturingapparatus 102 be connectable to an internal or external display unit 306and have at least a display control function for controlling display ofthe display unit 306.

The recording medium 310 is able to record image data output from theimage capturing unit 302. The recording medium 310 may be configured tobe detachable from the image capturing apparatus 102 or may be includedin the image capturing apparatus 102. That is, it is only necessary thatthe image capturing apparatus 102 have at least a way of accessing therecording medium 310.

The communication unit 311 is an interface that enables an externalapparatus to be connected. The image capturing apparatus 102 of thepresent embodiment is able to exchange data with an external apparatusvia the communication unit 311. For example, image data generated by theimage capturing unit 302 is able to be transmitted to an externalapparatus via the communication unit 311. In addition, it is possible toremotely control image capturing by the image capturing unit 302 from anexternal apparatus via the communication unit 311. Note that, in thepresent embodiment, the communication unit 311 includes an interface forcommunicating with an external apparatus by using a so-called wirelessLAN that conforms to the IEEE 802.11 standard. For using an externalapparatus, the communication unit 311 also includes a USB (UniversalSerial Bus) interface and a USB cable. The control unit 301 implementswireless communication with an external apparatus by controlling thecommunication unit 311.

A sound input unit 307 is a device by which sound information to beadded to image data recorded on the recording medium 310 is input, andsound data information converted from the sound information by the soundinput unit 307 is recoded on the recording medium 310 in a sound fileformat as a sound memo (camera sound memo) associated with the imagedata.

Note that the communication unit 311 of the image capturing apparatus102 in the present embodiment has an AP (access point) mode in which thecommunication unit 311 operates as an access point in an infrastructuremode, and a CL (client) mode in which the communication unit 311operates as a client in the infrastructure mode. In addition, by causingthe communication unit 311 to operate in the CL mode, the imagecapturing apparatus 102 in the present embodiment is able to operate asCL (client) equipment in the infrastructure mode. When the imagecapturing apparatus 102 operates as the CL equipment, establishing aconnection with peripheral AP (access point) equipment enablesparticipation on a network formed by the AP equipment. Furthermore, bycausing the communication unit 311 to operate in the AP mode, the imagecapturing apparatus 102 in the present embodiment is also able tooperate as a simplified AP (hereinafter, referred to as a simple AP),which is one type of AP, but with more limited functionality. When theimage capturing apparatus 102 operates as the simple AP, the imagecapturing apparatus 102 forms a network by itself. A peripheralapparatus of the image capturing apparatus 102 recognizes the imagecapturing apparatus 102 as AP equipment and is able to participate onthe network formed by the image capturing apparatus 102. It is assumedthat a program for causing the image capturing apparatus 102 to operateas described above is held in the non-volatile memory 303.

Note that, although the image capturing apparatus 102 in the presentembodiment is one type of AP, the image capturing apparatus 102 is asimple AP that does not have a gateway function for transferring datareceived from CL equipment via an Internet service provider or the like.Therefore, even when receiving data from another apparatus participatingon a network formed by the image capturing apparatus 102, the imagecapturing apparatus 102 does not transfer data to a network such as theInternet.

Next, the external appearance of the image capturing apparatus 102 willbe described. FIGS. 3B and 3C are views each illustrating an example ofthe external appearance of the image capturing apparatus 102. Therelease switch 305 a, the reproduce button 305 b, a direction key 305 c,the touch panel 305 d, and a recording button 305 e are operationmembers included in the operation unit 305 described above. In addition,the display unit 306 displays an image obtained as a result of imagecapturing by the image capturing unit 302. The image capturing apparatus102 of the present embodiment also has a USB cable connection terminal311 in a side surface of a camera housing. When the USB cable isconnected to the USB/IF 208 of the communication equipment 101, thecommunication equipment 101 and the image capturing apparatus 102 areable to use high-speed data communication according to a USB standard.

(Image Transfer Application Display Screen)

Next, an aspect of an image transfer application which is executed bythe communication equipment 101 in the present embodiment and by whichimage data stored in the image capturing apparatus 102 is transferred tothe server 103 will be described with reference to FIG. 4 .

FIG. 4 is a schematic view illustrating a display state of an operationscreen of the image transfer application and a function thereof. In FIG.4 , a screen 401 is an image selection screen on which a list ofthumbnails 402 to 406 acquired from an image file saved to the recordingmedium 310 of the image capturing apparatus 102 is displayed. When aportion in which each of the thumbnails 402 to 406 is displayed on thescreen 401 is tapped, the thumbnail is able to be switched between aselected state and a deselected state. The selected state and thedeselected state of the thumbnail are distinguishable from each otherdue to having different graphical presentations, and a thumbnail in theselected state is a target of recording or transfer described later. InFIG. 4 , the thumbnails 403 and 406 are in the selected state and thethumbnails 402, 404, and 405 are in the deselected state. For example,when colors, thicknesses, or the like of frames of thumbnail images aredifferentiated, the user is able to distinguish between the selectedstate and the deselected state.

An image 404 is an example of thumbnail display of an image fileaccompanied by a camera sound memo and indicates by icon display (camerasound memo icon 407) that the sound memo associated with the image 404is saved to the recording medium 310 upon a press of the recordingbutton 305 e of the image capturing apparatus 102.

An image 405 is an example of thumbnail display of an image fileaccompanied by a terminal sound memo and indicates by icon display(terminal sound memo icon 408) that the sound memo (terminal sound memo)recorded in the communication equipment 101 and associated with theimage 405 is saved to the recording medium 310.

Note that, in an image 406, both the camera sound memo icon 407 and theterminal sound memo icon 408 are displayed, indicating that sound memosof both accompany an image file. In the present embodiment, bydisplaying different icons in accordance with a type of sound memo, theuser is able to distinguish the type of the sound memo accompanying theimage file but may also be able to distinguish the type by anotherdisplay method.

In FIG. 4 , when a return button 409 on the screen 401 is tapped,selection states of all images displayed on the screen 401 are discardedand the image transfer application ends. When a recording button 410 istapped in a state where an image is selected on the screen 401,recording for a fixed time starts and a sound memo is generated andsaved to the storage 207. The generated sound memo is recorded inassociation with an image that is in the selected state on the screen401, and the terminal sound memo icon 408 is displayed on a thumbnail.Note that, display of an image on which the terminal sound memo icon 408has already been displayed does not change.

Next, when a transfer button 411 is tapped in a state where an image isselected on the screen 401, the communication equipment 101 obtains,from the image capturing apparatus 102, an image file of an originalimage of each image in the selected state on the screen 401 andtransfers the image file to the server 103 that is set as a transferdestination. When a camera sound memo has been associated with an imageto be transferred, the communication equipment 101 also obtains thecamera sound memo from the image capturing apparatus 102 and transfersthe camera sound memo to the server 103. Further, when a terminal soundmemo is associated with an image to be transferred, the communicationequipment 101 also transfers the terminal sound memo saved to thestorage 207 to the server 103. Note that, the original image obtainedfrom the image capturing apparatus 102 here is an image having higherquality than that of an image displayed as a thumbnail.

Note that, when a setting button 412 on the screen 401 is tapped, asetting panel for setting instructions for operation of the imagetransfer application is opened.

FIG. 5 illustrates an example of a confirmation screen displayed whenthe image 406 associated with both a camera sound memo and a terminalsound memo is transferred to the server 103. When the transfer button411 is tapped in a state where the image 406 is selected, a confirmationpanel 501 is displayed in an overlaid manner. Selection buttons 502 to505 are arranged on the confirmation panel 501 and the respectiveselection buttons correspond to sound memos to be transferred. That is,when the selection button 502 is tapped, only the camera sound memo isto be transferred. When the selection button 503 is tapped, only theterminal sound memo is to be transferred. When the selection button 504is tapped, both the camera sound memo and the terminal sound memo are tobe transferred. When the selection button 505 is tapped, no sound memois to be transferred. The confirmation panel 501 enables the user toselect a sound memo, which is to be transferred, in accordance withintended use.

An operation screen of the image transfer application illustrated ineach of FIGS. 4 and 5 is an example and another display method may beused. Note that, although an image transfer function and a sound memorecording function of the image transfer application have beendescribed, the present application may have another function. The imagetransfer application may enable IPTC (International PressTelecommunications Council) information registered in an image in theimage capturing apparatus 102 to be displayed by the communicationequipment 101 or may enable the IPTC (International PressTelecommunications Council) information to be edited in thecommunication equipment 101 and registered in an image in the imagecapturing apparatus 102.

(Setting Panel)

In a case where both a terminal sound memo and a camera sound memo existfor an image to be transferred and the sound memos are overlapped, theuser may be able to perform selection with use of the confirmation panel501 as described above or setting may be performed in advance.

FIG. 6 is a schematic view illustrating an example of a sound memosetting portion in a setting screen of the image transfer application inthe present embodiment. For example, in FIG. 4 , when the setting button412 in the screen 401 is tapped, a setting panel 601 is displayed.Although various items are considered as setting items in addition to asound memo, the sound memo will be described here and description forother setting items will be omitted.

Items (check boxes) 602 to 605 with radio buttons are displayed on thesetting panel 601. The items 602 to 605 are contradictory, and when anyone of them is selected, the other items are cancelled. That is, controlis performed so that a plurality of items are not valid at the sametime.

When the item 602 is selected, both the sound memos are to betransferred at a time of image transfer. When the item 603 is selected,only the camera sound memo is to be transferred, and when the item 604is selected, only the terminal sound memo is to be transferred. Further,when the item 605 is selected, no sound memo is transferred.

Note that, by adjusting an item content, similar control is able to beapplied even in a case where there is only any one of sound memos forthe image. In addition, the setting panel 601 is an example and a screendisplay method of the image transfer application and a content of anoption are not limited thereto.

(Transfer Processing Flow)

Next, a flow of processing of transferring an image and a sound memo ofthe image transfer application in the present embodiment will bedescribed with reference to a flowchart of FIG. 7 .

In FIG. 7 , when an instruction to transfer an image is accepted at stepS701 upon a press of the transfer button 411, original images of theimages 403 and 406 that are in the selected state on the operationscreen are received from the image capturing apparatus 102, and thereceived images are transferred to the server 103 at step S702.

Whether or not a camera sound memo accompanies each of the transmittedimages is checked at step S703, and whether or not a terminal sound memoaccompanies each of the images is checked at step S704 or step S711.

In a case where both the camera sound memo and the terminal sound memoaccompanies the same image to be transmitted at step S704, setting oftransfer when the sound memos are overlapped is checked at step S705.For example, the setting of transfer is checked in a state where settingis performed in the transfer setting panel 601 in FIG. 6 . Then, in acase where a result of checking the setting panel 601 indicates that thecheck box 602 is checked and an instruction to set both the sound memosis given at step S706, the terminal sound memo is transferred to theserver 103 at step S707, and then, the camera sound memo is receivedfrom the image capturing apparatus 102 at step S708, and the camerasound memo is transferred to the server 103 at step S709.

In a case where transfer of both the sound memos is not designated inthe transfer setting panel 601 at step S706, when transmission of thecamera sound memo in the check box 603 is instructed at step S710, it isdetermined that priority is given to the camera sound memo, and thecamera sound memo is received from the image capturing apparatus 102 atstep S708 and the camera sound memo is transferred to the server 103 atstep S709. When transmission of the terminal sound memo in the check box604 is instructed at step S710, it is determined that priority is givento the terminal sound memo, and the terminal sound memo is transferredto the server 103 at step S712.

In a case where it is determined that the camera sound memo exists andthat no terminal sound memo exists at step S704, the camera sound memois received from the image capturing apparatus 102 at step S708 and thecamera sound memo is transferred to the server 103 at step S709. In acase where it is determined that no camera sound memo exists and thatthe terminal sound memo exists at step S711, the terminal sound memo istransferred to the server 103 at step S712. Note that, in a case whereit is determined that no camera sound memo or terminal sound memo existsat step S711, the procedure ends as it is.

By following the flow illustrated in FIG. 7 , in the image transferapplication in the present embodiment, in accordance with an addingsituation of a sound memo to an image to be transferred and a settingstate of the user, transfer of the sound memo is able to be controlledin accordance with a condition. Note that, order of transfer of imagesand sound memos, a determination timing, and the like are not limited tothe example of FIG. 7 , and are able to be appropriately changed for adesign reason.

Other embodiments of the present disclosure will be described below withreference to drawings. In each of the embodiments, the configuration(FIG. 1 ) of the system, the configuration (FIG. 2 ) of thecommunication equipment 101, and the configuration (FIG. 3A) of theimage capturing apparatus 102 are the same as those of the firstembodiment, so that description thereof will be omitted.

Second Embodiment

As a second embodiment of the present disclosure, processing in which,when an operation of newly generating a camera sound memo or a terminalsound memo and adding the resultant to an image is performed, transferof the image starts upon the operation will be described.

FIG. 8 is a schematic view illustrating a configuration of a transfersetting panel 801 of a sound memo in an image transfer application thatruns in the communication equipment 101 in the present embodiment.

In FIG. 8 , items 802 to 805 with radio buttons are displayed on thetransfer setting panel 801. Functions of the respective radio buttonsare equivalent to those of the items 602 to 605 with the radio buttonsin FIG. 6 . With a check button 806, switching on and off of check areable to be switched. In a state where the check of the check button 806is switched on, setting is performed so that transfer of an image startswhen a state changes, for example, due to a sound memo generated andadded in the image capturing apparatus 102 or the communicationequipment 101.

Next, a flow of processing of transferring an image and a sound memo inthe present embodiment will be described with reference to a flowchartof FIG. 9 .

FIG. 9 is the flowchart for explaining transfer processing when a camerasound memo is added in the image capturing apparatus 102.

In FIG. 9 , when a sound memo is added in the image capturing apparatus102 at step S901, notification indicating that the sound memo is addedis transmitted from the image capturing apparatus 102 to thecommunication equipment 101 at step S902. Thereby, the communicationequipment 101 is able to detect the addition of the sound memo. At stepS903, when the image transfer application does not run on thecommunication equipment 101, the processing ends as it is, but when theimage transfer application runs and a state where an image is able to betransferred is provided, a state of the transfer setting panel 801 ischecked at step S904.

In a case where the check box 806 is in an off state at step S905, theprocessing ends without any change. On the other hand, in a case wherethe check box 806 is in an on state and setting to start update uponaddition of a sound memo is performed at step S905, an original image ofan image added with the sound memo is acquired from the image capturingapparatus 102 at step S906. Then, the acquired image is transferred tothe server 103 at step S907, and further, the added sound memo isacquired from the image capturing apparatus 102 at step S908, and thesound memo is transferred to the server 103 at step S911.

FIG. 10 is a flowchart for explaining transfer processing when aterminal sound memo is added in the communication equipment 101.

In FIG. 10 , in a case where it is detected at step S1001 that a soundmemo is added in the communication equipment 101, when the imagetransfer application does not run on the communication equipment 101 atstep S1002, the processing ends as it is, but when the image transferapplication runs and a state where an image is able to be transferred isprovided, a state of the transfer setting panel 801 is checked at stepS1003.

In a case where the check box 806 is in the off state at step S1004, theprocessing ends without any change. On the other hand, in a case wherethe check box 806 is in the on state and setting to start update uponaddition of a sound memo is performed at step S1004, an original imageof a captured image added with the sound memo is acquired from the imagecapturing apparatus 102 at step S1005. Then, the acquired image istransferred to the server 103 at step S1006, and further, the addedsound memo is transferred to the server 103 at step S1007.

In the present embodiment, transfer of an image starts upon a sound memonewly generated and added, but may start upon a change of a content ofthe sound memo, application or change of a rating value of the image,trimming editing of the image, or the like. Further, when a commonoperation is performed in the image capturing apparatus 102 and thecommunication equipment 101, transfer processing of an image may startupon the operation.

Third Embodiment

Next, as a third embodiment of the present disclosure, processing bywhich, when an instruction to transmit an image and a sound memo isgiven by the user, transfer processing is performed with a minimumamount of communication and a minimum time of communication bypreventing retransfer of data that has been already transmitted will bedescribed.

(Transfer Management Table Configuration)

FIG. 11 is a schematic view of a transfer management table for managinga transfer situation of an image in an image transfer application thatruns on the communication equipment 101. It is assumed that the transfermanagement table is generated on the basis of a list of images, whichare recorded on the recording medium 310 of the image capturingapparatus 102, when the communication equipment 101 and the imagecapturing apparatus 102 are connected firstly, is developed onto the RAM206 when the image transfer application runs, and is updated at any timein accordance with an operation of the image capturing apparatus 102,such as image capturing or deletion of an image.

In FIG. 11 , an image management table 1101 is constituted by the numberof images 1102 of all images recorded on the recording medium 310 of theimage capturing apparatus 102 and an information block 1103 of each ofthe images.

An image information block 1104 is an internal configuration of theinformation block 1103 of each of the images and holds, as informationabout the image, an image file name 1105, an image transfer date andtime 1106, and an image transfer flag 1107. The image file name 1105 isa character string of a file name of the image, the image transfer dateand time 1106 is information about date and time when the image istransferred last time, and the image transfer flag 1107 is a bit flagindicating whether or not the image is transferred.

The image information block 1104 also holds a camera sound file name1108, a camera sound file transfer date and time 1109, and a camerasound transfer flag 1110 as information about a camera sound memoassociated with the image and recorded on the recording medium 310 ofthe image capturing apparatus 102. The camera sound file name 1108 is acharacter string of a file name of the sound memo that accompanies theimage, the camera sound transfer date and time 1109 is information aboutdate and time when the camera sound memo is transferred, and the camerasound transfer flag 1110 is a bit flag indicating whether or not thecamera sound memo is transferred.

The image information block 1104 also holds a terminal sound file name1111, a terminal sound file transfer date and time 1112, and a terminalsound transfer flag 1113 as information about a terminal sound memoassociated with the image and recorded in the storage 206 of thecommunication equipment 101. The terminal sound file name 1111 is acharacter string of a file name of the sound memo that accompanies theimage, the terminal sound transfer date and time 1112 is informationabout date and time when the terminal sound memo is transferred, and theterminal sound transfer flag 1113 is a bit flag indicating whether ornot the terminal sound memo is transferred.

Since whether or not an image of the image capturing apparatus 102 istransferred to the server 103 is able to be determined by referring to arecord in the transfer management table, a non-transferred image is ableto be displayed through filtering on thumbnail display of the imagetransfer application. The display through filtering makes it easy tocollectively select non-transferred images and designate them astransfer targets.

(Processing Flow)

Next, a flow of processing of transferring a needed file to the server103 with use of the transfer management table by preventing retransferof an image or sound memo that has been already transferred in thepresent embodiment will be described with reference to a flowchart ofFIG. 12 .

In FIG. 12 , by referring to the image information block 1104 of thetransfer management table 1101 related to an image that is designated tobe transferred, a transfer history thereof is checked at step S1201.

At step S1202, whether or not the image has already been transferred isdetermined by referring to a state of the image transfer flag 1107.However, even in a case where the image transfer flag 1107 indicates analready transferred state, when the image transfer date and time 1106 isearlier as a result of comparison between the transfer date and time1106 and date and time when the image is updated, the image is regardedas being updated after the transfer and treated as not having beentransferred yet.

When the image is regarded as not having been transferred yet, the imageis captured from the image capturing apparatus 102 at step S1203 andtransferred to the server 103 at step S1204.

When it is determined at step S1202 that the image has been transferred,whether or not there is a camera sound memo is determined at step S1205in accordance with presence or absence of the camera sound file name1108 in the image information block 1104.

In a case where there is a camera sound memo at step S1205, whether thecamera sound memo is to be transferred and has already been transferredis determined on the basis of a setting state of the transfer settingpanel 601, the camera sound transfer flag 1110, and the camera soundtransfer date and time 1109 at step S1206. Although whether or nothaving already been transferred is recorded in a bit state in the camerasound transfer flag 1110, even in a case where the flag indicates thealready transferred state, when the camera sound transfer date and time1109 is earlier as a result of comparison between the transfer date andtime 1109 and date and time when a file is updated, the camera soundmemo is regarded as being updated after the transfer and is treated asnot having been transferred yet. In a case where the camera sound memois regarded as not having been transferred yet, the camera sound memo iscaptured from the image capturing apparatus 102 at step S1207 andtransferred to the server 103 at step S1208.

In a case where there is no camera sound memo at step S1205 or thecamera sound memo is determined as being not to be transferred or ashaving been already transferred at step S1206, the procedure proceeds tostep S1209 without transferring the camera sound memo.

Whether or not there is a terminal sound memo is determined at stepS1209 in accordance with presence or absence of the terminal sound filename 1111 in the image information block 1104. In a case where there isa terminal sound memo, whether the terminal sound memo is to betransferred and has already been transferred is determined on the basisof a setting state of the transfer setting panel 601, the terminal soundtransfer flag 1113, and the terminal sound transfer date and time 1112at step S1210. Although whether or not having already been transferredis recorded in a bit state in the terminal sound transfer flag 1113,even in a case where the flag indicates the already transferred state,when the terminal sound transfer date and time 1112 is earlier as aresult of comparison between the transfer date and time 1112 and dateand time when a file is updated, the terminal sound memo is regarded asbeing updated after the transfer and treated as not having beentransferred yet.

In a case where the terminal sound memo is regarded as not having beentransferred yet, a terminal sound file is transferred to the server 103at step S1211.

When any of the image, the camera sound memo, and the terminal soundmemo is transferred, the items of the image transfer date and time 1106,the image transfer flag 1107, the camera sound transfer date and time1109, the camera sound transfer flag 1110, the terminal sound transferdate and time 1112, and the terminal sound transfer flag 1113 in thetransfer management table 1101 are updated in accordance with a contentof the transfer at step S1212.

As a result, in the present embodiment, not only by recording a state ofa file that has been already transferred but also by comparingchronological order of date and time of transfer and date and time ofupdate, needed file information is able to be appropriately registeredin the server 103.

Further, when the transfer flows of a non-transferred image and anon-transferred sound memo described in the present embodiment are usedat a time of end of the image transfer application, a non-transferredimage or sound is able to be reliably transferred to the server 103without being discarded.

(Prevention of Overlapping File Names at Time of Retransfer)

Note that, in the present embodiment, even in a case of an image orsound memo that is transferred to the server 103 once, when beingupdated after previous transfer, it is determined that the image or thesound memo needs to be retransmitted, and the image or the sound memo isretransferred. A previous transfer file may be overwritten depending onsetting of the server 103, or the like. In order to cope with such aproblem, a method of changing a file name of the image or the sound memoat a time of retransfer and transferring the image or the sound memo tothe server 103 is able to be adopted.

In a case where it is determined at step S1202 in FIG. 12 thatretransfer is needed even though the image has been transferred, a filename different from the character string of the image file name 1105 isable to be generated and designated as a file name to be saved aftertransfer to the server 103. When the new file name generated at thistime is added and held in a region of the image file name 1105 of thetransfer management table 1101, an image file that has been transmittedat the previous transfer or before that is able to be prevented frombeing overwritten on the server 103. A character string with any lengthis expected to be input in the region of the image file name 1105, but adata configuration of a list format of a character string is desired sothat a plurality of file names are able to be additionally stored.

Though a method of avoiding overwriting of a file name of an image fileon the server 103 due to retransfer of an updated image has beendescribed here, a similar method is able to be adopted also for a soundmemo. That is, in a case of a camera sound memo, when it is determinedat step S1206 that retransfer is needed, a file name different from thecharacter string saved in the camera sound file name 1108 is generatedand designated as a file name to be saved after transfer to the server103, and the new file name to be saved is added to the camera sound filename 1108. In addition, in a case of a terminal sound memo, when it isdetermined at step S1209 that retransfer is needed, a file namedifferent from the character string saved in the terminal sound filename 1111 is generated and designated as a file name to be saved aftertransfer to the server 103, and the new file name to be saved is addedto the terminal sound file name 1111.

Such avoidance of overlapping of file names on the server 103 aftertransfer is able to be carried out by similar information managementalso in an embodiment other than the present embodiment.

Fourth Embodiment

Next, as a fourth embodiment of the present disclosure, a method of,when a camera sound memo and a terminal sound memo are overlapped,combining two sound files at a time of transmission and transferring theresultant will be described.

(Setting Panel)

FIG. 13 is a schematic view illustrating an example of a sound memosetting portion in a setting screen of an image transfer application inthe present embodiment and the setting portion has a function equivalentto that of the setting panel in FIG. 6 in the embodiment describedabove.

Selection items (check boxes) 1302 to 1305 with radio buttons aredisplayed on a setting panel 1301. The respective selection items arecontradictory, and control is performed so that, when any one of them isselected, selection of the other items is cancelled and two or moreitems are not valid at the same time.

In a case where the item 1302 is designated, when there are a terminalsound memo and a camera sound memo and the sound memos are overlapped ata time of transfer of an image, a combined sound memo generated bycombining sound files of both of them is to be transferred. In a casewhere the item 1303 is designated, only the camera sound memo is to betransferred. When the item 1304 is designated, only the terminal soundmemo is to be transferred. When the item 1305 is designated, no soundmemo is transferred.

Note that, even in a case where a setting panel in a form different fromthat in FIG. 13 is used, when special processing is able to bedesignated for overlapped sound memos at a time of transfer of the soundfiles, an equivalent effect is able to be obtained.

(Transfer Processing Flow)

Next, a flow of processing of transferring an image and a sound memo inthe image transfer application in the present embodiment will bedescribed with reference to a flowchart of FIG. 14 .

In FIG. 14 , when an instruction to transfer an image is accepted upon apress of the transfer button 411 at step S1401, original image data ofimages 403 and 406 that are in the selected state on the operationscreen is received from the image capturing apparatus 102 and receivedimages are transferred to the server 103 at step S1402.

Whether or not a camera sound memo accompanies each of the transmittedimages is checked at step S1403, and whether or not a terminal soundmemo accompanies each of the images is checked at step S1404 or stepS1413.

When a camera sound memo and a terminal sound memo accompany thetransmitted image at step S1404, setting of transfer when the soundmemos are overlapped is checked at step S1405 from a state that is setin the transfer setting panel 1301 of FIG. 13 . As a result of checkingthe setting panel 1301, when the check box 1302 is checked and aninstruction to combine sound files of both of them is given at stepS1406, the camera sound memo is captured from the image capturingapparatus 102 at step S1407 and the captured camera sound memo iscombined with a sound file of the terminal sound memo, which is saved inthe communication equipment 101, at step S1408. Then, a new sound filegenerated by the combining is transferred to the server 103 at stepS1409 and a series of processing ends.

When combining of sound files is not designated in the transfer settingpanel 1301 at step S1406 and an instruction to give priority to thecamera sound memo of the check box 1303 is given at step S1410, and whenit is determined at step S1404 that a camera sound memo exists but noterminal sound memo exists in the transmitted image, the camera soundmemo is received from the image capturing apparatus 102 at step S1411and the camera sound memo is transferred to the server 103 at stepS1412. When an instruction to give priority to the terminal sound memoof the check box 1304 is given at step S1410, and when it is determinedat step S1413 that no camera sound memo exists but a terminal sound memoexists in the image, the sound file of the terminal sound memo istransferred to the server 103 at step S1414.

When it is determined at step S1413 that neither a camera sound memo nora terminal sound memo exists in the transferred image, the processingends as it is without any change.

According to the example of the flow indicated above, in the imagetransfer application of the present embodiment, in accordance with anadding situation of a sound memo to an image to be transferred and asetting state of the user, transfer of a sound memo file is able to becontrolled, but order of transfer of an image file and a sound file, adetermination timing, and the like are not limited to the example ofFIG. 14 , and are able to be appropriately changed for a design reason.

Note that, although the communication equipment 101 has been describedin the embodiments described above by taking a smartphone as an example,the communication equipment 101 is not particularly limited as long asbeing equipment in which any application is able to be installed. Forexample, a mobile telephone, a tablet, a personal computer (PC), or thelike is similarly applicable as the communication equipment 101.Further, the communication equipment 101 does not necessarily include acommunication function and may be implemented by a wide variety ofinformation processing apparatuses.

Various embodiments of the present disclosure are also able to beachieved by processing in which a program that achieves one or morefunctions of the aforementioned embodiments is supplied to a system oran apparatus via a network or a storage medium, and one or moreprocessors in a computer of the system or the apparatus read out andexecute the program. Moreover, various embodiments of the presentdisclosure are also able to be achieved by a circuit (for example, anApplication Specific Integrated Circuit (ASIC) or a Field ProgrammableGate Array (FPGA)) that realizes one or more functions.

Other Embodiments

Embodiment(s) of the present disclosure can also be realized by acomputer of a system or apparatus that reads out and executes computerexecutable instructions (e.g., one or more programs) recorded on astorage medium (which may also be referred to more fully as a‘non-transitory computer-readable storage medium’) to perform thefunctions of one or more of the above-described embodiment(s) and/orthat includes one or more circuits (e.g., application specificintegrated circuit (ASIC)) for performing the functions of one or moreof the above-described embodiment(s), and by a method performed by thecomputer of the system or apparatus by, for example, reading out andexecuting the computer executable instructions from the storage mediumto perform the functions of one or more of the above-describedembodiment(s) and/or controlling the one or more circuits to perform thefunctions of one or more of the above-described embodiment(s). Thecomputer may comprise one or more processors (e.g., central processingunit (CPU), micro processing unit (MPU)) and may include a network ofseparate computers or separate processors to read out and execute thecomputer executable instructions. The computer executable instructionsmay be provided to the computer, for example, from a network or thestorage medium. The storage medium may include, for example, one or moreof a hard disk, a random-access memory (RAM), a read only memory (ROM),a storage of distributed computing systems, an optical disk (such as acompact disc (CD), digital versatile disc (DVD), or Blu-ray Disc (BD)™),a flash memory device, a memory card, and the like.

According to the embodiments described above, when a captured image istransmitted to a server via a communication terminal connected to animage capturing apparatus, a sound memo related to the captured image tobe transferred is also able to be transmitted to the server regardlessof whether or not the image capturing apparatus has a sound inputfunction, and organizing and assembling images after image capturing, orthe like is able to be effectively performed.

While exemplary embodiments have been described, it is to be understoodthat the invention is not limited to the disclosed exemplaryembodiments. The scope of the following claims is to be accorded thebroadest interpretation so as to encompass all such modifications andequivalent structures and functions.

This application claims the benefit of Japanese Patent Application No.2019-130283, filed Jul. 12, 2019, which is hereby incorporated byreference herein in its entirety.

What is claimed is:
 1. A communication terminal that acquires a capturedimage from an image capturing apparatus and transfers the captured imageto a server, the communication terminal comprising: a firstcommunication unit configured to communicate with the image capturingapparatus; a second communication unit configured to communicate withthe server; a display unit configured to communicate with the imagecapturing apparatus by using the first communication unit, acquire, fromthe image capturing apparatus, a captured image managed by the imagecapturing apparatus, and display the captured image; a sound input unitconfigured to accept sound input for generating a terminal sound memorelated to the captured image displayed by the display unit; a storageunit configured to store the generated terminal sound memo inassociation with the captured image; and a control unit configured toperform control so that the second communication unit transfers, to theserver, the captured image acquired from the image capturing apparatusby using the first communication unit and the generated terminal soundmemo associated with the captured image, wherein, in a case where both acamera sound memo generated by the image capturing apparatus and thegenerated terminal sound memo exist for the captured image that is to betransferred, the control unit determines based on a predeterminedcondition whether to transfer the terminal sound memo together with thecaptured image to the server or to transfer the camera sound memotogether with the captured image to the server, and wherein the firstcommunication unit, the second communication unit, the display unit, thesound input unit, the storage unit, and the control unit are implementedby one or more processors.
 2. The communication terminal according toclaim 1, wherein in a case where a captured image to be transferred isassociated with both a camera sound memo generated by the imagecapturing apparatus and the terminal sound memo generated by thecommunication terminal, the control unit performs control, in accordancewith a predetermined condition, so that the second communication unittransfers the terminal sound memo and the camera sound memo to theserver.
 3. The communication terminal according to claim 2, wherein thecontrol unit performs control so that the second communication unittransfers the terminal sound memo and the camera sound memo to theserver in accordance with a date and a time when a sound memo isupdated.
 4. The communication terminal according to claim 2, wherein thedisplay unit displays a setting screen on which the predeterminedcondition is set.
 5. The communication terminal according to claim 1,wherein in a case where it is detected that a camera sound memo is newlygenerated by the image capturing apparatus, the control unit controlsthe first communication unit and the second communication unit to startprocessing of acquiring a captured image and the generated camera soundmemo from the image capturing apparatus and transferring the capturedimage and the camera sound memo to the server.
 6. The communicationterminal according to claim 1, wherein in a case where it is detectedthat a terminal sound memo is newly generated by the communicationterminal, the control unit controls the first communication unit and thesecond communication unit to start processing of acquiring a capturedimage from the image capturing apparatus and transferring the capturedimage together with the generated terminal sound memo to the server. 7.The communication terminal according to claim 5 further comprising amanagement unit implemented by one or more processors and configured tomanage a transfer history, wherein in a case where management by themanagement unit indicates that a captured image with which a newlygenerated terminal sound memo or camera sound memo is associated hasbeen transferred, the control unit transfers only the generated terminalsound memo or camera sound memo to the server.
 8. The communicationterminal according to claim 5 further comprising a management unitimplemented by one or more processors and configured to manage atransfer history, wherein in a case where management by the managementunit indicates that, with respect to a captured image with which a newlygenerated terminal sound memo or camera sound memo is associated, aterminal sound memo or camera sound memo previously generated has beentransferred, a file name of the newly generated terminal sound memo orcamera sound memo is differentiated from a file name of the sound memothat has been transferred.
 9. The communication terminal according toclaim 2 further comprising a combining unit implemented by one or moreprocessors and configured to, in a case where a captured image withwhich a newly generated terminal sound memo is associated has a camerasound memo generated by the image capturing apparatus, combine theterminal sound memo and the camera sound memo to generate a combinedsound memo.
 10. The communication terminal according to claim 2, whereinin a case of terminating an application that performs processing foracquiring a captured image from the image capturing apparatus andtransferring the captured image to the server, the control unit controlsthe first communication unit and the second communication unit totransfer, to the server, a terminal sound memo and a camera sound memothat have not been transferred yet.
 11. The communication terminalaccording to claim 1, wherein the display unit performs thumbnaildisplay of captured images managed by the image capturing apparatus andacquired by using the first communication unit and enables a user toselect, from the thumbnail display, a captured image for which aterminal sound memo is to be generated.
 12. The communication terminalaccording to claim 11, wherein the display unit displays, on thethumbnail display, an icon indicating that a sound memo exists.
 13. Thecommunication terminal according to claim 12, wherein the display unitdisplays an icon for distinguishing between the terminal sound memo anda camera sound memo.
 14. A method for controlling a communicationterminal that acquires a captured image from an image capturingapparatus and transfers the captured image to a server, the methodcomprising: communicating with the image capturing apparatus, acquiring,from the image capturing apparatus, a captured image managed by theimage capturing apparatus, and displaying the captured image; generatinga terminal sound memo related to the captured image displayed uponacceptance of sound input; storing the generated terminal sound memo inassociation with the captured image; and transferring, to the server,the captured image acquired from the image capturing apparatus and thegenerated terminal sound memo associated with the captured image,wherein, in a case where both a camera sound memo generated by the imagecapturing apparatus and the generated terminal sound memo exist for thecaptured image that is to be transferred, whether to transfer theterminal sound memo together with the captured image to the server or totransfer the camera sound memo together with the captured image to theserver is determined based on a predetermined condition.
 15. Acommunication system comprising: an image capturing apparatus; and acommunication terminal that acquires a captured image from the imagecapturing apparatus and transfers the captured image to a server,wherein the image capturing apparatus includes an image capturing unit,and a management unit configured to manage a captured image obtained bythe image capturing unit, wherein the image capturing unit, and themanagement unit are implemented by one or more processors, and thecommunication terminal includes a first communication unit configured tocommunicate with the image capturing apparatus, a second communicationunit configured to communicate with the server, a display unitconfigured to communicate with the image capturing apparatus by usingthe first communication unit, acquire, from the image capturingapparatus, the captured image managed by the image capturing apparatus,and display the captured image, a sound input unit configured to acceptsound input for generating a terminal sound memo related to the capturedimage displayed by the display unit, a storage unit configured to storethe generated terminal sound memo in association with the capturedimage, and a control unit configured to perform control so that thesecond communication unit transfers, to the server, the captured imageacquired from the image capturing apparatus by using the firstcommunication unit and the generated terminal sound memo associated withthe captured image, wherein, in a case where both a camera sound memogenerated by the image capturing apparatus and the generated terminalsound memo exist for the captured image that is to be transferred, thecontrol unit determines based on a predetermined condition whether totransfer the terminal sound memo together with the captured image to theserver or to transfer the camera sound memo together with the capturedimage to the server, and wherein the first communication unit, thesecond communication unit, the display unit, the sound input unit, thestorage unit, and the control unit are implemented by one or moreprocessors.
 16. A non-transitory computer readable medium storing aprogram causing a computer to function as the communication terminalaccording to claim 1.